1,633 research outputs found
“Knowledge Utilization” Universities: A Paradigm for Applying Academic Expertise to Social and Environmental Problems
Promoting the utilization of knowledge is an important mission for institutions of higher education. The knowledge utilization university should stand alongside the research university, the professional school, the liberal arts college, and the community college as one of the five archetypes of higher education institutions.
Environmental problems typify a class of social problems that require the utilization of existing knowledge in a trans-disciplinary manner just as much as they require the creation of new knowledge through research. These problems are characterized by their multiple dimensions-- they have scientific, technical, social, political, economic, and ethical aspects, all of which must be taken into account in an integrated way when seeking solutions. Knowledge utilization universities can contribute to solving environmental problems in three ways: by offering interdisciplinary degree programs that address environmental issues; by offering on and off-campus programs that promote environmental literacy; and by involving their faculties in technical assistance and consulting activities with community organizations.
To accomplish this, these institutions must develop organizational cultures that support the mission of promoting knowledge utilization. Cultures that were developed to encourage research or teaching are not likely to be effective in encouraging the activities that promote knowledge utilization
Intelligent Assistance for the Data Mining Process: An Ontology-based Approach
A data mining (DM) process involves multiple stages. A simple, but typical, process might include
preprocessing data, applying a data-mining algorithm, and postprocessing the mining results. There
are many possible choices for each stage, and only some combinations are valid. Because of the
large space and non-trivial interactions, both novices and data-mining specialists need assistance in
composing and selecting DM processes. We present the concept of Intelligent Discovery Assistants
(IDAs), which provide users with (i) systematic enumerations of valid DM processes, in order that
important, potentially fruitful options are not overlooked, and (ii) effective rankings of these valid
processes by different criteria, to facilitate the choice of DM processes to execute. We use a prototype
to show that an IDA can indeed provide useful enumerations and effective rankings. We discuss
how an IDA is an important tool for knowledge sharing among a team of data miners. Finally,
we illustrate all the claims with a comprehensive demonstration using a more involved process and
data from the 1998 KDDCUP competition.Information Systems Working Papers Serie
Towards Intelligent Assistance for a Data Mining Process:-
A data mining (DM) process involves multiple stages. A simple, but typical, process might include
preprocessing data, applying a data-mining algorithm, and postprocessing the mining results.
There are many possible choices for each stage, and only some combinations are valid.
Because of the large space and non-trivial interactions, both novices and data-mining specialists
need assistance in composing and selecting DM processes. Extending notions developed for
statistical expert systems we present a prototype Intelligent Discovery Assistant (IDA), which
provides users with (i) systematic enumerations of valid DM processes, in order that important,
potentially fruitful options are not overlooked, and (ii) effective rankings of these valid processes
by different criteria, to facilitate the choice of DM processes to execute. We use the prototype to
show that an IDA can indeed provide useful enumerations and effective rankings in the context
of simple classification processes. We discuss how an IDA could be an important tool for
knowledge sharing among a team of data miners. Finally, we illustrate the claims with a comprehensive
demonstration of cost-sensitive classification using a more involved process and data
from the 1998 KDDCUP competition.NYU, Stern School of Business, IOMS Department, Center for Digital Economy Researc
Intelligent Assistance for the Data Mining Process: An Ontology-based Approach
A data mining (DM) process involves multiple stages. A simple, but typical, process might include
preprocessing data, applying a data-mining algorithm, and postprocessing the mining results. There
are many possible choices for each stage, and only some combinations are valid. Because of the
large space and non-trivial interactions, both novices and data-mining specialists need assistance in
composing and selecting DM processes. We present the concept of Intelligent Discovery Assistants
(IDAs), which provide users with (i) systematic enumerations of valid DM processes, in order that
important, potentially fruitful options are not overlooked, and (ii) effective rankings of these valid
processes by different criteria, to facilitate the choice of DM processes to execute. We use a prototype
to show that an IDA can indeed provide useful enumerations and effective rankings. We discuss
how an IDA is an important tool for knowledge sharing among a team of data miners. Finally,
we illustrate all the claims with a comprehensive demonstration using a more involved process and
data from the 1998 KDDCUP competition.Information Systems Working Papers Serie
07491 Abstracts Collection -- Mining Programs and Processes
From 02.12. to 17.12.2007, the Dagstuhl Seminar 07491 ``Mining Programs and Processes\u27\u27 was held in Schloss Dagstuhl~--~Leibniz Center for Informatics.
During the seminar, several participants presented their current
research, and ongoing work and open problems were discussed. Abstracts of
the presentations given during the seminar as well as abstracts of
seminar results and ideas are put together in this paper. The first section
describes the seminar topics and goals in general.
Links to extended abstracts or full papers are provided, if available
Data Analytics on Online Labor Markets: Opportunities and Challenges
The data-driven economy has led to a significant shortage of data scientists. To address this shortage, this study explores the prospects of outsourcing data analysis tasks to freelancers available on online labor markets (OLMs) by identifying the essential factors for this endeavor. Specifically, we explore the skills required from freelancers, collect information about the skills present on major OLMs, and identify the main hurdles for out-/crowd-sourcing data analysis. Adopting a sequential mixed-method approach, we interviewed 20 data scientists and subsequently surveyed 80 respondents from OLMs. Besides confirming the need for expected skills such as technical/mathematical capabilities, it also identifies less known ones such as domain understanding, an eye for aesthetic data visualization, good communication skills, and a natural understanding of the possibilities/limitations of data analysis in general. Finally, it elucidates obstacles for crowdsourcing like the communication overhead, knowledge gaps, quality assurance, and data confidentiality, which need to be mitigated
The Relational Vector-space Model
This paper addresses the classification of linked entities. We
introduce a relational vector (VS) model (in analogy to the
VS model used in information retrieval) that abstracts the linked
structure, representing entities by vectors of weights. Given
labeled data as background knowledge training data, classification
procedures can be defined for this model, including a
straightforward, "direct" model using weighted adjacency vectors.
Using a large set of tasks from the domain of company affiliation
identification, we demonstrate that such classification procedures
can be effective. We then examine the method in more detail,
showing that as expected the classification performance correlates
with the- relational auto correlation of the data set. We then turn
the tables and use the relational VS scores as a way to
analyze/visualize the relational autocorrelation present in a
complex linked structure. The main contribution of the paper 1s to
introduce the relational VS model as a potentially useful addition
to the toolkit for relational data mining. It could provide useful
constructed features for domains with low to moderate relational
autocorrelation; it may be effective by itself for domains with high levels of relational autocorrelation, and it provides a useful
abstraction for analyzing the properties of linked data.Information Systems Working Papers Serie
A Multi-Stream Approach for Video Understanding
The automatic annotation of higher-level semantic information in long-form video content is still a challenging task. The Deep Video Understanding (DVU) Challenge aims at catalyzing progress in this area by offering common data and tasks. In this paper, we present our contribution to the 3rd DVU challenge. Our approach consists of multiple information streams extracted from both the visual and the audio modality. The streams can build on information generated by previous streams to increase their semantic descriptiveness. Finally, the output of all streams can be aggregated in order to produce a graph representation of the input movie to represent the semantic relationships between the relevant characters
- …